Modelling speaking rate using a between frame distance metric
نویسندگان
چکیده
It is well known [5] that variations in speaking rate can account for a signi cant percentage of errors in practical speech recognition tasks. This is the result of the dynamic nature of speech which is not modelled properly by the standard HMM structure. This paper proposes an extension to the standard HMM that takes advantage of the information about the rate of speech that is contained in inter-frame transitions. The new model can be seen as a combination of Moore and Mealy type HMM's that has output probabilities attached to the transitions between states in addition to the conventional output probabilities attached to states. In this model fast and slow transitions are associated with additional hidden parameters. The output probabilities of the transitions are modelled with gamma distributions.
منابع مشابه
یادگیری نیمه نظارتی کرنل مرکب با استفاده از تکنیکهای یادگیری معیار فاصله
Distance metric has a key role in many machine learning and computer vision algorithms so that choosing an appropriate distance metric has a direct effect on the performance of such algorithms. Recently, distance metric learning using labeled data or other available supervisory information has become a very active research area in machine learning applications. Studies in this area have shown t...
متن کاملMATHEMATICAL MODELLING OF THE EFFECT OF FOAM DEGRADATION ON MOULD FILLING IN THE GREY IRON EPC PROCESS
In this investigation a new model was developed to calculate gas pressure at the melt/foam interface (Gap) resulting from foam degradation during mould filling in the Lost Foam Casting (LFC) process. Different aspects of the process, such as foam degradation, gas elimination, transient mass, heat transfer, and permeability of the refractory coating were incorporated into this model. A Computati...
متن کاملA CHARACTERIZATION FOR METRIC TWO-DIMENSIONAL GRAPHS AND THEIR ENUMERATION
The textit{metric dimension} of a connected graph $G$ is the minimum number of vertices in a subset $B$ of $G$ such that all other vertices are uniquely determined by their distances to the vertices in $B$. In this case, $B$ is called a textit{metric basis} for $G$. The textit{basic distance} of a metric two dimensional graph $G$ is the distance between the elements of $B$. Givi...
متن کاملFixed point theorems under c-distance in ordered cone metric space
Recently, Cho et al. [Y. J. Cho, R. Saadati, S. H. Wang, Common xed point theorems on generalized distance in ordered cone metric spaces, Comput. Math. Appl. 61 (2011) 1254-1260] dened the concept of the c-distance in a cone metric space and proved some xed point theorems on c-distance. In this paper, we prove some new xed point and common xed point theorems by using the distance in ordered con...
متن کاملAssamese Vowel Phoneme Recognition Using Zero Crossing Rate and Short-time Energy
Speaker recognition is the identification of the person who is speaking by the characteristics of their voices. Assamese is a Indo-Aryan family of languages, mainly spoken in the North-Eastern of India. In this paper text dependent speaker modelling technique is used. The system contains training phase, the testing phase and the recognition phase. The database consists of utterance of 10 speake...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999